Learning to Identify Ambiguous and Misleading News Headlines
نویسندگان
چکیده
Accuracy is one of the basic principles of journalism. However, it is increasingly hard to manage due to the diversity of news media. Some editors of online news tend to use catchy headlines which trick readers into clicking. These headlines are either ambiguous or misleading, degrading the reading experience of the audience. Thus, identifying inaccurate news headlines is a task worth studying. Previous work names these headlines “clickbaits” and mainly focus on the features extracted from the headlines, which limits the performance since the consistency between headlines and news bodies is underappreciated. In this paper, we clearly redefine the problem and identify ambiguous and misleading headlines separately. We utilize class sequential rules to exploit structure information when detecting ambiguous headlines. For the identification of misleading headlines, we extract features based on the congruence between headlines and bodies. To make use of the large unlabeled data set, we apply a co-training method and gain an increase in performance. The experiment results show the effectiveness of our methods. Then we use our classifiers to detect inaccurate headlines crawled from different sources and conduct a data analysis.
منابع مشابه
Contrastive Analysis of Political News Headlines Translation According to Berman’s Deformative Forces
The present research aimed at investigating the deformation of political news headlines translation between English and Persian News Agencies based on Berman`s deformative system. For this purpose, 100 news headlines in English were selected from BBC, Reuters, Associated Press, France, France 24, Financial Times, Business Times, New York Times, Politico, Guardian, CNN, Bloomberg, Middle East Ey...
متن کاملThe effects of subtle misinformation in news headlines.
Information presented in news articles can be misleading without being blatantly false. Experiment 1 examined the effects of misleading headlines that emphasize secondary content rather than the article's primary gist. We investigated how headlines affect readers' processing of factual news articles and opinion pieces, using both direct memory measures and more indirect reasoning measures. Expe...
متن کاملMachine Learning Approach to Augmenting News Headline Generation
In this paper, we present the HybridTrim system which uses a machine learning technique to combine linguistic, statistical and positional information to identify topic labels for headlines in a text. We compare our system with the Topiary system which, in contrast, uses a statistical learning approach to finding topic descriptors for headlines. The Topiary system, developed at the University of...
متن کاملMisinformation in News Coverage of Professional and College Athlete Musculoskeletal Ailments
Background: The general population’s understanding of musculoskeletal health is likely influenced by media reports of the ailments of prominent athletes. We assessed factors independently associated with debatable or potentially misleading medical statements in mainstream sports media coverage of the ailments of professional and college athletes.Methods: We identified and assessed 200 Int...
متن کاملHelping News Editors Write Better Headlines: A Recommender to Improve the Keyword Contents & Shareability of News Headlines
We present a software tool that employs state-ofthe-art natural language processing (NLP) and machine learning techniques to help newspaper editors compose effective headlines for online publication. The system identifies the most salient keywords in a news article and ranks them based on both their overall popularity and their direct relevance to the article. The system also uses a supervised ...
متن کامل